Scaling Embeddings with Feast and KubeRay
โกONNX Runtime
Flag this post
Machine Learning and CPU (Central Processing Unit) Scheduling Co-Optimization over a Network of Computing Centers
arxiv.orgยท2d
๐NCCL
Flag this post
Opportunistically Parallel Lambda Calculus
๐กLSP
Flag this post
Rethinking Networking for the AI/ML Era
lukew.comยท1d
๐NCCL
Flag this post
Show HN: Using GitHub Pages as zero-cost APT repository with global CDN
๐๏ธBuild Systems
Flag this post
A portable picokernel for async I/O
๐Profiling Tools
Flag this post
Accelerating AI inferencing with external KV Cache on Managed Lustre
cloud.google.comยท1d
โกONNX Runtime
Flag this post
Challenging the Fastest OSS Workflow Engine
๐งPTX
Flag this post
Research roundup: 6 cool science stories we almost missed
arstechnica.comยท2h
๐ONNX
Flag this post
Should I build a cluster?
๐๏ธBuild Optimization
Flag this post
**Adaptive Algorithmic Profiling & Resource Allocation via Dynamic Markov Chain Optimization**
โกONNX Runtime
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
๐NCCL
Flag this post
KAITO and KubeFleet: Projects Solving AI Inference at Scale
thenewstack.ioยท1d
๐MLOps
Flag this post
Speedrunning an RL Environment
๐TorchScript
Flag this post
How my NAS taught me to stop trusting the cloud blindly
xda-developers.comยท53m
๐Nsight
Flag this post
Loading...Loading more...